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Abstract 

Only a small proportion of genetic variation in serum ferritin lias been explained by variant genetic studies, and genome- 
wide association study (GWAS) for serum ferritin has not been investigated widely in Chinese population. We aimed at 
exploring the novel genetic susceptibility to serum ferritin, and performed this two stage GWAS in a healthy Chinese 
population of 3,495 men aged 20-69 y, including 1,999 unrelated subjects in the first stage and 1,496 independent 
individuals in the second stage. Serum ferritin was measured with electrochemiluminescence immunoassay, and DNA 
samples were collected for genotyping. A total of 1,940,243 SNPs were tested by using multivariate linear regression 
analysis. After adjusting for population stratification, age and BMI, the rs5742933 located in the 5'UTR region of PMSI gene 
on chromosome 2 was the most significantly associated with ferritin concentrations (P-combined = 2.329xl0~^°) (P= — 
0.11, 95% CI: -0.14, -0.07). IVloreover, this marker was about 200kb away from the candidate gene SLC40A1 which is 
responsible for iron export. PMSI gene was the novel genetic susceptibility to serum ferritin in Chinese males and its 
relation to SLC40A1 needs further study. 
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Introduction 

Ferritin, one of the key proteins regulating iron homeostasis, is a 
widely available clinical indicator to evaluate iron status [1]. As a 
serum marker of iron status, elevated serum ferritin is not only 
used to screen for iron overload, but also applied to predict 
metabolic syndrome and type 2 diabetes [2-5] . Growing evidences 
suggested that genetic factors contributed 20-30% of the variation 
to blood iron concentrations [6-9]. However, only a small 
proportion of genetic variation in serum ferritin has been 
explained by variant genetic studies [10]. It was well known that 



the two subunits of ferritin were synthesized under the control of 
different genes in chromosomes 1 1 and 19, respectively [1 1,12]. As 
most cases of genetic hemochromatosis were associated with the 
C282Y (a cystine to tyrosine mutation at position 282) [10], Milet 
et al reported firstly that BMP2 gene was associated with serum 
ferritin in C282Y homozygotes patients [13]. Later, it was 
demonstrated that TMPRSS6 polymorphisms were significandy 
associated with ferritin as well [14]. 

The genome-wide association study (GWAS) is a powerful and 
unbiased tool for the identification of common genetic variants 
associated with complex traits. To the best of our knowledge, there 
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have been two GWASs for serum ferritin, both of which were 
performed on Australian samples [10,15]. In the first GWAS on 
adult female monozygotic twins, genes TF and HFE were 
associated with serum ferritin [10], while the other GWAS on 
adolescent and adult individuals from twin families reported the 
association of serum ferritin with gene TMPRSS6 [15]. Moreover, 
in a meta-analysis of two GWASs in both semi-isolated and 
outbred populations from Italy and the USA, the HFE locus has 
been identified to be associated with serum ferritin, and the 
involvement of TF, TMPRSS6 and HFE genes in the mainte- 
nance of iron homeostasis was confirmed [16]. Recently, in a 
candidate gene study of Chinese Hans, two variants of TMPRSS6 
gene (V736A and D521D) were confirmed to be associated with 
ferritin concentrations, but the TF and HFE genes were not 
studied [14]. Considering that only a small proportion of genetic 
variation has been identified for serum ferritin, and there were 
diversities in allele and genotype frequencies in different ethnic 
populations, it is necessary' to (explore common genetic variants 
associated with the serum ferritin in Chinese. Thus, we conducted 
this two-stage GWAS in a healthy Chinese male population in 
search of population-specific genetic variations associated with 
serum ferritin. 

Methods 

Study population 

A two-stage GWAS was performed to identify the genes/loci 
that influence serum ferritin concentrations. 

Stage 1 of the GWAS included 1,999 unrelated healthy Chinese 
men aged 20-69 years old from the Fangchenggang Area Male 
Health and Examination Survey (FAMHES). The FAMHES is 
described elsewhere [17]. Briefly, it was designed to investigate the 
effects of environmental and genetic factors. All subjects were free 
of stroke, primary hypertension, diabetes meUitus, rheumatoid 
arthritis, hyperthyroidism, tumors, coronary heart disease, and 
hepatic or renal dysfunction. AU men who participated in physical 
examinations in the Medical Centre of Fangchenggang First 
People's Hospital from September 2009 to December 2009 were 
invited to participate in the study (n = 4,364). A total of 4,303 
participants (98.6%) provided informed consent and blood 
samples. There were 2,012 people randomly selected from 
southern Chinese Han ethnicity. After selected by age criteria, a 
total of 1,999 individuals passed the call rate of 95% and were used 
in the final statistical analysis. 

Stage 2 of the GWAS consisted of 1,496 healthy Chinese men 
aged 20-69 years old. They were randomly selected from male 
participants who participated in physical examinations from 
September 2009 to September 2010 in the Medical Centre of 
Fangchenggang First People's Hospital, Guigang People's Hospital 
and Yulin First People's Hospital. Stage 2 samples from 
Fangchenggang First People's Hospital were independentiy 
recruited from the stage 1 samples. In stage 2,996 were of Han 
ethnicity and 500 were of Zhuang ethnicity. The same recruitment 
strategy was used in stages 1 and 2. 

Comprehensive health information was collected through 
clinical examination, and additional demographic information 
was obtained via a standardized questionnaire. We obtained 
written documentation of informed consent from all study 
participants, and the research protocol was approved by the local 
ethics committee. Both smoking and drinking in two stages were 
assessed on the basis of a self-administered life-style questionnaire 
according to the same protocol. Respondents that reported 
smoking currendy (daily smoking >6 months) were coded as 
smokers, and those reported drinking any beverage 'more than 



once a year' were coded as drinkers, whereas others were non- 
drinkers [18]. The study was approved by the Ethics and Human 
Subject Committee of Guangxi Medical University, 

Measurement of serum ferritin 

The description of the laboratory test has been previously 
reported in detafl [19]. Briefly, about 10 ml overnight fasting 
venous blood specimens were collected between 8:00 and 11:00 
am and were transported frozen to the testing center of 
Department of Clinical Laboratory at the First Affiliated Hospital 
of Guangxi Medical University in Nanning in two hours, which 
were centrifuged within 15 to 25 min and stored at — 80°C until 
analysis. Ferritin was measured with electrochemiluminescence 
immunoassay on COBAS 6000 system E601 (Elecsys module) 
immunoassay analyzer (Roche Diagnostics, GmbH, Mannheim, 
Germany) with the same batch of reagents, and the inter-assay 
coefficient of variation was 3.4%. 

Genotyplng 

In our study, two diflerent platforms were used for single 
nucleotide polymorphism (SNP) genotyping. For stage 1, geno- 
typlng was performed by using the lUumina Omni 1 platform. The 
Sequenom iPLEX system (Sequenom, Inc., San Diego, CA, USA) 
was used in stage 2. Polymerase chain reaction and extension 
primers were designed using Mass ARRAY Assay Design 3.1 
software (Sequenom, Inc.). Manufacture's iPLEX Application 
Guide (Sequenom, Inc.) was performed for genotyping proce- 
dures. All of the genotyping reactions were performed in 384-weU 
plates. Each plate included a duplicate for three or four 
participants selected at random, as weU as six to nine negative 
controls in which water was substituted for DNA. The average 
concordance rate was 99.8%. 

Statistical analysis 

Quality control procedures were applied to 1,999 unrelated 
individuals that were genotyped using the lUumina Omni-Express 
platform [17]. Total 1,999 samples passed the caUrate of 95% and 
were included in the final GWAS analysis. We then applied the 
following QC criteria to filter SNPs: P<0.001 for the Hardy- 
Weinberg equilibrium test, minor allele frequency <0.01 and 
genotype call rate <95%. Based on these criteria, 709,211 SNPs 
were retained. The IMPUTE computer program [20] was then 
used to infer the genotypes of SNPs (e.g. SNPs catalogued in 
Hapmap Phase II CHB population release #24) in the genome 
that was not directly genotyped. A posterior probability of >().9() 
was applied to call genotypes that were imputed using IMPUTE 
software. After applying the same QC criteria, as used above, a 
total of 1,940,243 SNPs remained in the final analysis. Analysis for 
ferritin was performed on log-transformed values. Linear regres- 
sion implemented in PLINK [21] was used to estimate the SNP 
association under the assumption of an additive relationship 
between the number of copies and the residual log-transformed 
ferritin value. Population stratification was estimated by a 
principal component approach, as implemented by EIGEN- 
STRAT software [22] Clinical covariates utilized in the linear 
regression modeling included age at the time of ferritin measure- 
ment, body mass index (BMI, weight in kg divided by the height in 
m^). For regions with multiple SNPs that were significant at P< 
10~^, multivariate linear regression analysis was applied to test the 
independence of the respective SNPs. Only the SNPs that 
remained significant at 10 ''in the multivariate analysis were 
selected. The combined analysis of two-stage data was performed 
using a linear regression, adjusting for the covariates (population 
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stratification, age, BMI and stage information). The P coefficient 
and 95% confidence internal (95% CI) was reported. 

Results 

The general cliaracteristics of the samples in this study were 
described in Table 1. There were 1,999 participants in stage 1 
and 1,496 participants in stage 2. No significant difference was 
observed between the two stages in age distribution (37.5 versus 
37.3 years, P = 0.54), BMI (23.3 versus 23.5 kg/m\ P = 0.18) and 
smoking behavior (P = 0.66), excepting for alcohol consumption 
(P — 0.02). As showed in Figure 1, the QuantUe-Quantile plot of 
adjusted P values with the inflation factor of 1.01 show no 
systematic bias. When the top two Eigens were added to other 
covariates in the GWAS analysis, similar results were obtained. 
The inflation factor indicated that there was no population 
substructure in the GWAS analysis. The genome-wide association 
results were presented in the Manhattan plot (Figure 2). 

In stage 1, we performed the multivariate regression analysis 
adjusting for population stratification, age, BMI, and selected the 
SNPs with P< 1.0x10"'^. As showed in Table 2, we totaUy 
identified nine loci on chromosomes 2. The rs5742933, located in 
the end of 5'UTR region of gene postmeiotic segregation 
increased 1 (PMSl) was the most significantly associated with 
ferritin concentrations (P = 4.699 x 10"") (P=-0.05, 95% CI: - 
0.07, —0.03). It was also in strong linkage disequilibrium (LD) with 
the other eight SNPs (all both and D'>0.9), not only located in 
PMSl but also in ANKAR, OSGEPLl and ORMDLl. According 
to LD and hap-block of SNPs, only rs5742933 was selected to be 
further confirmed. 

In stage 2, the association between rs5742933 and serum ferritin 
was validated by examining 1,496 healthy subjects. The rs5742933 
reached with a P-value of 6.777x10"* (P=-0.09, 95% CI: - 
0.14, —0.04) in the second stage after adjusting for the same 
covariates. It remained significantiy associated with ferritin 
concentrations after further adjusting for the stage information 
(combined-P = 2.329x10"'") (P=-0.11, 95% CI: -0.14, - 
0.07). Interestingly, rs5742933 was about 200 kb away from the 
candidate gene SLC40A1 as showed in Figure 3. The genotypes 
of rs5742933 showed significant association with ferritin (P< 
0.001); serum ferritin basically decreased with increasing numbers 
of minor allele C (Table 3). However, the genotypes of rs5742933 
did not show statistically significant association with smoking or 
alcohol consumption (both P>0.05). 




0 2 4 6 8 

-loQio (Expected P value) 

Figure 1. Quantile-Quantile plot of genome-wide quantitative 
trait loci mapping for log-transformed serum ferritin concen- 
trations. 

doi:1 0.1 371/journal.pone.01 05844.g001 

Discussion 

Serum ferritin plays an important role in clinical researches; 
however, comprehensive genetic assessments of the variability in 
ferritin remain poorly studied. We performed this two-stage 
GWAS in 3,495 Chinese adult males in search of the genetic 
variations associated with serum ferritin. We found that the 
rs5742933 located in the 5'UTR region oi PMSl gene was the 
most significanti SNP associated with serum ferritin, which have 
never been reported before in any population. 

The rs5742933 is in the 5'UTR of PMSl gene that is a DNA 
mismatch repair gene. This SNP was predicted as an exonic 
splicing silencer that may inhibit or silence splicing of the pre- 
mRNA, or as a transcription factor binding site that may affect the 
level, location, or timing of gene expression [23]. As one of tiie 
potentially functional polymorphisms in PMSl, the rs5742933 
may serve as candidate prognostic marker of clinical outcome of 
non-small-cell lung cancer [24]. PMSl was expressed in many 
tissues including haematopoietic cells [25]. In a study of Japanese 
children with aplastic anemia (AA), IgG antibodies against PMSl 
were detected in 10% patients (3/30), while no antibody responses 
to PMSl in normal volunteers [26]. In another multicentre study 
of more children with AA, PMSl antibodies were deselected in 
14.6% (15/103) of patients, nevertheless, but the role of PMSl 
antibodies in AA has not yet been determined [27]. Besides, the 
PMSl gene was also associated with systolic blood pressure and 
hypertension in a GWAS of 29,136 participants from six large 



Table 1. General characteristics of participants in the two-stage GWAS. 





Characteristics 


First stage 


Second stage 


P-value" 


n 


1,999 


1,496 




Age, years(mean±SD) 


37.5±11.1 


37.3±10.8 


0.54 


Smoking, n(%) 






0.66 


Yes 


1015(50.8) 


771(51.5) 




No 


984(49.2) 


725(48.5) 




Alcohol drinl<ing, n(%) 






0.02 


Yes 


1 704(85.5) 


1165(82.6) 




No 


288(14.5) 


246(1 7.4) 




Body mass index {l<g/m^) 


23.3±3.44 


23.5±3.34 


0.18 



'^T-test was used to compare means of the continuous variables between the first and the second stage; Chi-square test was used to compare the differences for 

categorical variables. 

doi:1 0.1 371/journal.pone.Ol 05844.t001 
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Figure 2. Manhattan plot of genome-wide association analysis for serum ferritin concentrations. X-axis shows chromosomal positions. 
Y-axis shows -loglO P-values from the linear regression adjusting for population stratification, age and BMI and stage information. Candidate gene 
names are showed for the significantly associated loci. 
doi:1 0.1 371 /journal.pone.01 05844.g002 



prospective observational studies in the CHARGE Consortium 
[28]. Although no previous studies examined the relationship 
between PMSl and ferritin, our finding may indicate the 
involvement of DNA mismatch repair in genetic control of ferritin 
concentrations. 

Interestingly, the PMSl gene was near Ferroporlin 1 
{SLC40A1), which is also known as HFE4 or SLC11A3. The 
protein encoded by this gene is a cell membrane protein that may 
be involved in iron export from duodenal epithelial cells. Defects 
in this gene are a cause of hemochromatosis type 4. The A77D 
mutation and the Vail 62 deletion in the gene SLC40A1 would 
lead to hyperferritinaemia and reticuloendothelial iron overload 
[29]. The SLC40A1 gene silencing in human macrophages would 
induce iron retention and enhance ferritin synthesis [30]. 
Moreover, the SLC40A1 gene may interact with the HFE, 
TFR2 genes, causing hyperferritinemia and an iron overload 
phenotype [31]. In the present study, the most significant SNP was 
located in the 5'UTR of PMSl and in strong LD with SNPs 
covermg PMSl (both and D'>0.99), but in considerably weak 
LD with SNPs covering SLC40A1 (both R^ and D'<0.20). These 
results suggested that PMSl, rather than SLC40A, was associated 
with serum ferritin, although these two genes were much closed. 
Nevertheless, we can not rule out the possibility of epigenetic effect 
between PMSl and SLC40A1 in the genetic control of serum 
ferritin. 

Although we found the PMSl was significantly related with the 
serum ferritin concentrations in Chinese males, we failed to report 
the other loci previously identified in other populations, such as 
HFE, BMP2, and TMPRSS6. Firstly, HFE was associated with 
serum ferritin in the GWAS for adult female monozygotic twins 
from Austraha [10], consistent with the previous study [32,33]; 
however, our study failed to confirm this. According to HapMap 
data, the C282Y and H63D mutation of HFE gene is common in 
Europeans but rare in Chinese. This characteristic of HFE may 
partly explained our negative results. Secondly, the BMP2 was 
associated with ferritin in the previous GWAS for elderly Chinese 
women with iron-deficiency anemia [34]. We failed to replicate 
the same result probably due to the difference in serum ferritin 
levels between elder females with anemia and middle-aged healthy 



males. Thirdly, TMPRSS6 was associated with serum iron and 
hemoglobin concentrations in several GWASs for Europeans 
[15,35,36]. The association of TMPRSS6 with serum ferritin was 
confirmed in Chinese [14], so was its association with hemoglobin, 
iron and transferrin saturation concentrations [34] . However, in a 
GWAS for Australian twin families, TMPRSS6 was associated 
with serum ferritin with a borderline P value around 10~ [15]. 

The limitations of our study should be evaluated objectively. 
Firstly, the sample size is probably not sufficient enough to make 
some genuine SNPs reach the GWAS significant level. Further 
study in Chinese with a larger sample size or meta-analysis of 
GWASs in different populations is recommended. Secondly, the 
participants in our study were adult males from general 
population, which might lead to a relative selection bias. Further 
study in females or children with AA may enhance our 
fmdings.Thirdly, only one SNP in one gene was selected to be 
further tested, so some genuine genes in the potential pathways 
may be neglected. Further well-designed biological experiments 
combined with bioinformatic analysis were recommended, espe- 
cially the quantitative real-time polymerase chain reaction (qRT- 
PCR). Last but not the least, the role of environmental 
backgrounds or gene-environment interactions cannot be ignored 
either. 

Conclusions 

In summary, to our knowledge, we are the first to perform the 
two-stage GWAS in Chinese male population to explore the 
genetic influence on serum ferritin concentrations. Our study 
observed that the rs57429331ocated in PMSl was the most 
significant SNP associated with serum ferritin, which suggested 
that PMSl may be a susceptibility gene affecting serum ferritin 
concentrations in Chinese male population. The candidate gene 
SLC40A1 were closed to gene PMSl, so the underlying relation 
between the SLC40A1 and PMSi needs further study. Our results 
may partiy contribute some evidence to the genetic control of 
serum ferritin or iron status. 
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Figure 3. Association of serum ferritin concentrations with SNPs at chromosome 2. X-axis shows base positions from 190,414 Kb to 
190,357 Kb. Y-axis shows -loglO P-values from linear regression adjusting for population stratification, age and BMI and stage information. Ferritin 
concentrations were log-transformed and fit for a normal distribution. The bottom panels describe all genes in the region. 
doi:10.1371/journal.pone.0105844.g003 
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